A regionalized national universal kriging model using Partial Least Squares regression for estimating annual PM2.5 concentrations in epidemiology.
نویسندگان
چکیده
Many cohort studies in environmental epidemiology require accurate modeling and prediction of fine scale spatial variation in ambient air quality across the U.S. This modeling requires the use of small spatial scale geographic or "land use" regression covariates and some degree of spatial smoothing. Furthermore, the details of the prediction of air quality by land use regression and the spatial variation in ambient air quality not explained by this regression should be allowed to vary across the continent due to the large scale heterogeneity in topography, climate, and sources of air pollution. This paper introduces a regionalized national universal kriging model for annual average fine particulate matter (PM2.5) monitoring data across the U.S. To take full advantage of an extensive database of land use covariates we chose to use the method of Partial Least Squares, rather than variable selection, for the regression component of the model (the "universal" in "universal kriging") with regression coefficients and residual variogram models allowed to vary across three regions defined as West Coast, Mountain West, and East. We demonstrate a very high level of cross-validated accuracy of prediction with an overall R2 of 0.88 and well-calibrated predictive intervals. In accord with the spatially varying characteristics of PM2.5 on a national scale and differing kriging smoothness parameters, the accuracy of the prediction varies by region with predictive intervals being notably wider in the West Coast and Mountain West in contrast to the East.
منابع مشابه
A National Prediction Model for PM2.5 Component Exposures and Measurement Error–Corrected Health Effect Inference
BACKGROUND Studies estimating health effects of long-term air pollution exposure often use a two-stage approach: building exposure models to assign individual-level exposures, which are then used in regression analyses. This requires accurate exposure modeling and careful treatment of exposure measurement error. OBJECTIVE To illustrate the importance of accounting for exposure model character...
متن کاملNational-Scale Estimates of Ground-Level PM2.5 Concentration in China Using Geographically Weighted Regression Based on 3 km Resolution MODIS AOD
High spatial resolution estimating of exposure to particulate matter 2.5 (PM2.5) is currently very limited in China. This study uses the newly released nationwide, hourly PM2.5 concentrations to create a nationwide, geographically weighted regression (GWR) model to estimate ground-level PM2.5 concentrations in China. A3 km resolution aerosol optical depth (AOD) product from MODIS is used as the...
متن کامل1 Fusing Observational , Satellite Remote Sensing and 2 Air Quality Model Simulated Data to Estimate 3 Spatiotemporal Variations of PM 2 . 5 Exposure in 4 China 5
Estimating ground surface PM2.5 with fine spatiotemporal resolution is a critical technique 17 for exposure assessments in epidemiological studies of its health risks. Previous studies have 18 utilized monitoring, satellite remote sensing or air quality modeling data to evaluate the 19 spatiotemporal variations of PM2.5 concentrations, but such studies rarely combined these data 20 simultaneous...
متن کاملModeling of the Relationships Between Spatio-Temporal Changes of Traffic Volume and Particulate Matter-2.5 Pollutant Concentration Based on Geographically Weighted Regression (GWR) and Inverse Distance Weighting (IDW) Model: A Case Study in Tehran M
Background and Aim: High concentrations of particulate matter-25 (PM2.5) have been the cause of the unhealthiest days in Tehran, Iran in recent years. This study was conducted with the aim of the spatio-temporal analysis of traffic volume and its relationship with PM2.5 pollutant concentrations in Tehran metropolis, Tehran during 2015-2018, using the Geographic Information System (GIS). Materi...
متن کاملVariance Estimation in Spatial Regression Using a Nonparametric Semivariogram Based on Residuals
The empirical semivariogram of residuals from a regression model with stationary errors may be used to estimate the covariance structure of the underlying process. For prediction (kriging) the bias of the semivariogram estimate induced by using residuals instead of errors has only a minor eeect because the bias is small for small lags. However, for estimating the variance of estimated regressio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Atmospheric environment
دوره 75 شماره
صفحات -
تاریخ انتشار 2013